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ABSTRACT 



Among the different classes of enzymes involved in the ubiquitin pathway, E2 ubiquitin-conjugating 
enzymes occupy a central role in the ubiquitination cascade. Cdc34-like E2 enzymes are character- 
ized by a 12-14 residue insertion in the proximity of the catalytic site, known as the acidic loop. 
Cdc34 ubiquitin-charging activity is regulated by CK2-dependent phosphorylation and the regula- 
tory mechanism involves the acidic loop. Indeed, the phosphorylation stabilizes the loop in an open 
conformation that is competent for ubiquitin charging. 

Cdc34 is associated with a variety of diseases, such as hepatocellular carcinomas and prostatic 
adenocarcinomas. In light of its role, the discovery of potential inhibitory compounds would provide 
the mean to effectively modulate its activity. 

Here, we carried out a computational study based on molecular dynamics, virtual screening and 
docking to identify potential inhibitory compounds of Cdc34, modulating the acidic loop conforma- 
tion. The molecules identified in this study have been designed to act as molecular hinges that can 
bind the acidic loop in its closed conformation, thus inhibiting the Cdc34-mediated ubiquitination 
cascade at the ubiquitin-charging step. In particular, we proposed a pharmacophore model featur- 
ing two amino groups in the central part of the model and two lateral aromatic chains, which 
respectively establish electrostatic interactions with the acidic loop (Asp 108 and Glu 109) and a 
hydrogen bond with Ser 139, which is one of the key residues for Cdc34 activity. 
© 2014 The Authors. Published by Elsevier B.V. on behalf of the Federation of European Biochemical Societies. This 
is an open access article under the CC BY-NC-ND license (http://creativecommons.Org/licenses/by-nc-nd/3.0/). 



1. Introduction 

Ubiquitination is a post-translational modification that was 
originally known as a signal for protein degradation by the 265 
proteasome [1 ]. It is also involved in many other different signaling 
pathways, including cell cycle, endocytosis, transcription, DNA 
repair, signal transduction, apoptosis and the immune response 
[1-4]. 

In the ubiquitination pathway, E2 enzymes charged with ubiq- 
uitin (Ub) can be recruited by an E3 Hgase, along with the target 
substrates. The C-terminal glycine of Ub can then be attached to 

Abbreviations: MD, molecular dynamics; Sc, Saccharomyces cerevisiae; 
Ub, ubiquitin; UBC, ubiquitin-binding domain 
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a lysine residue on the target substrate. This can result in the trans- 
fer of only a single Ub molecule (mono-ubiquitination) or the addi- 
tion of further Ub molecules to form a poly-Ub chain. Depending 
on the target lysine used to cross-link the Ub molecules in the 
chain, different poly-Ub chains can be formed, which adopt diverse 
three-dimensional (3D) structures and exert different biological 
effects [5-8]. 

Ubiquitination can thus be described as a molecular zip code, 
which is used to sort different ubiquitination products to different 
destinations. Errors in delivery of ubiquitinated proteins to the 
proteasome or other destinations are highly detrimental for the 
cell [3]. 

E2 enzymes (E2s) have a primary role in catalyzing, alone or 
with the cognate E3, the covalent attachment of Ub to the target 
proteins and they have a major role in defining the topology of 
the polyUb chain and thus the fate of the substrate [9]. E2s are 
often multi-domain proteins that all share a conserved Ub-binding 
domain (UBC characterized by a a/p fold) [9-11]. The highly 
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conserved active site Cys is located in a shallow groove of the UBC 
domain formed by a short loop connecting a-helix 2 and ot-helix 3 
and a long loop (p4-ot2 loop) proximal to the active site [9-11]. 

E2 UBC domains have been recently classified by Michelle et al. 
[12] into 17 families of homologs by performing phylogenetic anal- 
yses on 207 E2 genes belonging to seven different species. Family 3 
members, i.e. Cdc34-like enzymes are characterized by a conserved 
and disordered insertion in the p4-ot2 loop in the proximity of the 
catalytic site. The insertion is known as the acidic loop of Cdc34- 
like enzymes. 

In a previous work, we identified two co-evolving signature ele- 
ments in Cdc34-like E2 enzymes: the acidic insertion in p4-ot2 loop 
in the proximity of the catalytic cysteine and two conserved CK2 
phospho-sites within the UBC domain [13]. We previously demon- 
strated by combining Ub-charging assays and MD simulations that 
the phosphorylation at one of this sites (SI 30) can modulate the 
opening and closing of the p4-ot2 loop with respect to the catalytic 
cleft and, in turn, it modulates the accessibility of the catalytic Cys 
for Ub-charging [13]. More in details, this regulatory mechanism 
relies on electrostatic repulsive effects between the phosphory- 
lated serine and the acidic residues in the p4-ot2 loop. The loop 
can undergo a substantial shift and drift away from the catalytic 
cleft upon phosphorylation, promoting the accessibility of the cat- 
alytic Cys. 

Cdc34 is known to be involved in a variety of diseases, such as 
hepatocellular carcinomas and prostatic adenocarcinomas [14-19]. 
In fact, Cdc34 stimulates cellular proliferation by enhancing the 
degradation of p53 and p27, which both act as inhibitors of cell 
cycle progression. 

In light of the above scenario, recent studies have been focused 
in the design and identification of inhibitory molecules of E2 
enzymes and of Cdc34 in particular. A small inhibitor (CC0651) 
was identified for Cdc34 [20,21]. CC0651 was able to inhibit prolif- 
eration of human cancer lines and caused accumulation of the p27 
substrate. Another small inhibitor was identified for Rad6 E2s [22] 
with the capability to inhibit the thioester formation between the 
E2 catalytic cysteine and the C-terminal of Ub. These studies have 
opened the venue to study E2 enzymes as a suitable class of drug 
targets in the ubiquitination pathway. 

We contribute to this scenario, proposing a group of molecules 
with the potential to directly counteract the Ub-charging activity 
of Cdc34 by acting on the acidic loop (p4-ot2 loop) and keeping it 
in a closed conformation, shielding the catalytic cysteine needed 
for the attachment of the Ub molecule. The task was carried out 
by the integration of different computational approaches, as 
described by Sanders et al. [22]. In particular, we used a structural 
ensemble that was already available thank to our previous MD 
studies [13,23] to isolate the most representative conformations 
suitable for docking simulations. Subsequently, virtual screening 
and docking were performed to select putative compounds from 
735,758 entries of the ZINC database [24]. We selected 20 mole- 
cules by both energy-based and structural-based screening of 
docking simulations of 500 compounds. We then provide a phar- 
macophore model with the aim of inhibiting Cdc34 Ub-charging 
activity acting as molecular zipper to stabilize the closed and inac- 
tive conformation of the acidic loop. The results here described can 
provide a valuable dataset for future experimental studies in the 
field. 

2. Materials and methods 

2A. Rmsd-matrices and clustering 

The Cdc34 MD ensemble previously published [13] along with 
an increased sampling achieved by performing new simulations 



[23] was used as a reference conformational ensemble for the pres- 
ent investigation. In particular, we post-processed the ensemble by 
rmsd-based structural clustering as described in the following. 

C-alpha (Cot) root mean square deviation (rmsd) were calcu- 
lated pairwise for each pair of frames of the available MD ensem- 
ble, collecting values ranging from 0.2 to 0.65 nm, indicating 
respectively nearly identical or highly diverging structures in the 
ensemble. The highest rmsd values are associated to a displace- 
ment of the acidic loop conformations, whereas secondary struc- 
tures were conserved in all the MD structures and characterized 
by rmsd values lower than 0.25 nm [13]. The Ca rmsd matrix 
(Fig. IS) was then processed to obtain structural clusters of similar 
conformations using the Gromos algorithm implemented in Gro- 
macs (www.gromacs.org) with a clustering cutoff of 0.35 nm. 

2.2. Virtual screening and docking simulations 

The virtual screening and the docking calculations were per- 
formed with DOCK Blaster [25] and Autodock version 4.2 [26], 
respectively. DOCK Blaster is an online server that selects and 
scores thousands of compounds deposited in the ZINC database 
[24] for a target structure uploaded by the user. The center of mass 
of three residues (PI 1 0, II 37 and Nl 38) was used to set the grid for 
the calculation. In particular, we selected the ZINC subset 11 [24], 
containing 735,758 entries. Indeed, compounds belonging to this 
subset are described as lead-like and were selected to obey to 
the Lipinski rule [27], according to which an orally active drug 
has no more than one violation of the following criteria: not more 
than five hydrogen-bond (H-bonds) donors (nitrogen or oxygen 
atoms with one or more hydrogen atoms), no more than ten H- 
bond acceptors (nitrogen or oxygen atoms), a molecular weight 
under 500 Da and an octanol-water partition coefficient log P less 
than 5. 

Once the virtual screening procedure was completed, we 
employed Autodock version 4.2 [26] for docking calculations of 
the first 500 compounds selected by DOCK Blaster energy-rank. 
For each molecule, DOCK Blaster provided one binding pose char- 
acterized by the lowest energy according to the DOCK Blaster 
energy function. The DOCK Blaster binding pose was used as a 
starting structure for docking simulations with Autodock for each 
molecule (500 Autodock simulations overall). The parameters used 
for Autodock simulations are reported in the Supplementary 
Table IS. 

Autodock provided different binding poses for each of the 500 
molecules simulated. For each molecule, the binding poses gener- 
ated by Autodock are at first clusterized by the Autodock internal 
routines. For each cluster referred to a specific molecule, the soft- 
ware generally returns the binding pose with the lowest energy 
(cluster binding pose). We then applied a further selection proce- 
dure by the Pymol Python APIs (application programming inter- 
faces) on all the cluster binding poses provided by Autodock for 
each molecule. In particular, we used both energetic and struc- 
tural criteria in this final selection step. The final goal is to obtain, 
for each of the 500 screened molecules, a unique binding pose 
that is not only characterized by the lowest energy but also that 
can structurally act as a molecular zipper contacting both the 
acidic loop and the surrounding region in the catalytic cleft, as 
also discussed in the Results. With this aim in mind, we ranked 
by the Python routine with the Pymol APIs the cluster binding 
poses referred to the same molecule according to their energy. 
Among the two cluster binding poses of a molecule that are char- 
acterized by the lowest energy, the API routine selects the one 
that has a center of mass at the minimal distance from the bind- 
ing site (defined using as reference residues PI 10, 1137 and 
N138). 
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3. Results 

3.1. Selection of the starting structure for virtual screening 

To select a suitable Cdc34 structure for docking simulations, we 
post-processed the previously published MD ensemble of Cdc34 
[13,23]. 

A Cot rmsd matrix (Fig. IS) was built from the MD ensemble to 
evaluate the conformational variability among the collected struc- 
tures. A clustering algorithm was then used to extract from the 
rmsd matrix different clusters of similar conformations (Fig. IS, 
see Section 2 for details). In particular, we identified a highly pop- 
ulated cluster that also includes the conformations from the most 
populated basin of the previously published free energy landscape 
[13]. The Cdc34 structure selected for virtual screening and dock- 
ing simulations is the average conformation associated with this 
cluster. Indeed, it can be considered representative of the closed 
and inactive conformation of Cdc34, i.e. with the acidic loop in a 
closed conformation and the catalytic cysteine in a buried position 
(Fig. 1). 

3.2. Selection of a binding site for the inhibitors 

The binding site was selected according to the position and ori- 
entation of the acidic loop. We aimed at identifying molecules that 
can act as molecular zippers, i.e. able to stabilize the loop in the 
closed and inactive conformation so that the Ub-charging activity 
can be impaired. Thus, a potential inhibitor should establish strong 
intermolecular interactions with the acidic residues in the p4-ot2 
loop and, at the same time, it should be able to bind the surround- 
ing regions of the catalytic cleft to maintain the loop in the closed 
conformation. In particular, we selected three residues of Cdc34 
(PI 10, 1137 and N138) whose center of mass corresponds to a cav- 




Fig. 1. Cdc34 three-dimensional structure. The acidic loop is represented in yellow, 
while the catalytic cysteine in red. The three residues (PllO, 1137 and N138) 
selected to define the binding site for virtual screening are depicted in cyan (right 
panel). (For interpretation of the references to color in this figure legend, the reader 
is referred to the web version of this article.) 



ity located in the close proximity of the catalytic cysteine and the 
acidic loop (Fig. 1). The selection of this binding site was essential 
to establish a spatial grid as an input for the docking algorithm that 
we employed in the first screening step (DOCK Blaster). Among the 
735,758 molecules in the ZINC database (ZINC subset 11), the soft- 
ware returned 500 compounds ranked according to their binding 
free energies. 

DOCK Blaster parameters are optimized for virtual screening 
calculations and thus they only permit a fast but coarse selection 
among the large number of molecules deposited in the ZINC subset 
1 1. Thus to refine our data, we performed docking simulations with 
Autodock for each of the 500 molecules returned by DOCK Blaster 
(see Section 2). In particular, these 500 compounds were docked 
against Cdc34 structure, and the Autodock grid was built around 
the selected binding site in Fig. 2S, which also corresponds to the 
same search space used for the DOCK Blaster calculations. 

From a first visual inspection of the binding poses selected by 
Autodock internal routines (cluster poses, see Section 2 for details), 
it was not possible to detect any common feature that could be 
used to build a pharmacophore model (Fig. 2 and Table 1). There- 
fore a subsequent selection step was applied to filter out the mol- 
ecules that did not properly interact with Cdc34, In particular, we 
analyzed the clusters calculated by Autodock for each different 
compound to select molecules that may either bind Cdc34 with 
high affinity (poses that have the lower energies in the ranking) 
and act as molecular hinges/zippers by both interacting with the 
acidic loop and the surrounding structural components (poses at 
the lower distances from the binding site) (see Section 2). The 
poses of each of the 500 molecules were classified into three bins 
according to their distance from the binding site (Fig. 3) so that 
molecules belonging to the first bin had a center of mass at a dis- 
tance shorter than 0.8 nm from the center of mass of the previously 
defined binding site. This step is of crucial importance, since the 
inhibitors are expected to act as molecular hinges and therefore 
need to establish interactions with residues at the opposite sides 
of the acidic loop. According to these criteria, we selected a binding 
pose for each of the 500 molecules (see Section 2 for further 
details). After this post-clustering ranking procedure, the first 20 
lower energy molecules (Fig. 4, left panel) are all located in the 
same area of the binding site and share recurring structural 
features that are discussed in details below. 




Fig. 2. Autodock results upon clustering with Autodock internal routines. The first 
20 molecules from the Autodock energy-ranking list are depicted with different 
colors. The residues of the acidic loop are shown as yellow sticks. (For interpretation 
of the references to color in this figure legend, the reader is referred to the web 
version of this article.) 
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Interestingly, an overlap can be observed between the Autodock 
clustering and the post-clustering selection procedure that we 
applied. Indeed, the first nine molecules belonging to the first bin 
of the post-clustering procedure were also found in the Autodock 
ranking, as shown in Table 1. Nevertheless, the ranking order after 
post-clustering selection with API tools is significantly modified. In 
particular, the molecules belonging from 9th to 20th position in 
the post-clustering ranking do not appear in the first 20 positions 
of the classical Autodock energy-based ranking (Table 1 ). 

We emphasize the need for a spatial clustering procedure by 
showing in Fig. 5 the different positions for two molecules used 
as examples of the two possible scenarios, i.e. a case that is selected 
by the post-clustering ranking (left panel) and a case that is dis- 
carded upon this procedure (right panel). The left panel illustrates 
the orientation of the molecule C08743791 (4,6-dimethyl-N-[3-(5- 
phenyl-lH-pyrazol-3-yl)-lH-l,2,4-triazol-5-yl]pyrimidin-2-amine), 



whereas the right panel shows the molecule C20209924 (5- 
methyl-N-{2-[3-(3-pyridinyl)-lH-l,2,4-triazol-5-yl]ethyl}-2-indo- 
linecarboxamide). The first molecule has an orientation suited for 
interaction with both the acidic loop and the surrounding struc- 
tural components (VI 43 and VI 47), the latter is only interacting 
with the acidic loop. Though the energetic estimation for the sec- 
ond molecule is on average comparable to the first (~0.5 kcal/ 
mol), the spatial analysis suggests that it cannot act as a proper 
molecular hinge, since by only interacting with the acidic loop it 
cannot properly keep the loop in a closed conformation. For these 
reasons, the molecules presenting a binding mode similar to that 
of 5-methyl-N-{2-[3-(3-pyridinyl)-lH-l,2,4-triazol-5-yl]ethyl}-2- 
indolinecarboxamide were filtered out. 

In Fig. 4 (right panel), two amino groups are shown to be a 
recurrent feature of the molecules identified after post-clustering 
selection, which (as depicted in Fig. 6 for molecule 4,6-dimethyl- 



Table 1 

Docking results with Autodoci< before and after the clustering procedure. 



Docking ranking before clustering 



Docking ranking after clustering 



Zinc ID kcal/mol 



Zinc ID kcal/mol 



C16347514-8.71 

N-(lH-l,2,4-tnazoI-3-y I X{2-methy I-5- [(N-(lH-l^^^^ 
yI)carbamoyl)ami no]phenyI}amino)carboxamide 



C08743791 -8.32 

4,6-dimethyI-N-[3-(5-phenyI-lH-pyrazoI-3-y l )-lH-l,2,4-M^^ 




C08743791 -8.32 4.6-dimethvI-N-[3-(5-phenvl-lH-pvrazoI-3-vl)- C29447249 -8.22 

lH-1.2.4-triazoI-5-yI]pyrimidin-2-amine 6-imidazoI-l-yI-N-[3-(5-methyItetrazol-l-yI)phenyI]pyridine-3-carboxamide 





C09367105 -8.29 C14048487 -8.06 

N-[4-(lH-benzoimidazoI-2-yIcarbamoyI)phenyI]furan-2-carboxamide N-(lH-benzimidazoI-2-yI)-5-cycIopropanecarbonyIamino)thiophene-2- 

carboxamide 





C31808914-8.24 

2-benzyl-2.3.4.6-tetrahydro-lH- 

[1.3.5]triazino[l'.2':3.4][1.3.5]triazino[1.2-a]benzimidazole 



C29374529 -7.98 

3-phenyl-5-[[4-(tetrazol-l-yl)phenoxy]methyl]isoxazole 
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Table 1 {continued) 



C29447249 -8.22 6-imidazol-l -yl-N-[3-(5-methyItetrazol-l - 
yl)phenyl]pyridine-3-carboxamide 



C33082602 -7.86 

N-[4-methyI-3-(tetrazoI-l-yI)phenyl]-4-pyrazoI-l-yI-benzamide 





C14048487 -8.06 C17304375 -7.78 

N-(lH-benzimidazol-2-yl)-5-(cyclopropanecarbonylamino)thiophene-2- N-(lH-benzimidazol-2-yl)-5-phenyl-lH-pyrazole-3-carboxamide 



carboxamide 



C29374529 -7.98 

3-phenyI-5-[[4-(tetrazoI-l-yI)phenoxy]methyI]isoxazoIe 




C33082938 -7.59 N-[4-methyl-3-(tetrazol-l-y l ) phen yl]-4-Cl,2,4-triazol-l- 
yl)benzamide 




C23141050 -7.96 4- [(8-methyIimidazo[l,2-a]pvndin-2- C13018588 -7.43 

yl)methoxy]-N-(4H-1.2.4-triazoI-3-yI)benzamide 4-cyano-N-[5-(p-toIyn-4H-1.2.4-triazoI-3-yI]benzamide 





C33082602 -7.86 

N-[4-methyI-3-(tetrazoI-l -yI)phenyl]-4-pyrazol-l -yl-benzamide 




C20539931 -7.38 

N-(2J-dihydro-lH-inden-2-yn-5-[3-(methoxymethyIl-1.2.4-oxadiazoI-5-yI]-2- 
pyridinamine 



{continued on next page) 
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Table 1 {continued) 



C20855376 -7.82 C33066448 -7.32 

l-(S-benzyI-2H-l,2,4-triazol-3-yI)-3-(l,4,5,6- N-C2-oxo-3A-dihydro-lH-quinolin-6-yl)-5-phenyl-lH-pyrazole-3-carboxamide 
tetrahydrocyclopenta[d]pyrazoI-3-ylmethyl)urea 




C17304375 -7.78 N-(lH-benzimidazoI-2-vn-5-phenvI-lH-pvrazoIe- C12548747 -7.29 

3-carboxamide N-(lH-benzimidazol-2-yI)-4-ureido-benzamide 




C08115937 -7.75 C08116504 -7.25 

4-cyano-N-[5-(4-methoxyphenyI)-4H-1.2.4-triazoI-3-yI]-benzamide N-[5-(4-fIuorophenyI)-4H-1.2.4-triazoI-3-yI]-4-methoxy-benzamide 




C20209924 -7.75 C14115531 -7.23 

S-methyl-N-{2-[3-(3-pyridinyn-lH-1.2.4-triazoI-5-yI]ethvI}-2- 2-(4-cyanophenoxy)-N-(4-methyl-2-oxo-chromen-7-yl)acetamide 
indolinecarboxamide 




C33082938 -7.59 C23505593 -7.16 

N-[4-methyI-3-(tetrazoI-l-yI)phenyl]-4-(1.2.4-triazoI-lvnbenzamide 3-(6.7-dihvdro-5H-pyrroIo[12-d][1.2A]triazoI-3-yImethyI)-l-(5-isobutyl-4H-1.2.4- 

triazol-3-yl)urea 
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Table 1 {continued) 



C26259081 -7.48 C05262984 -7.13 n/ 

4- [(6-methyIimidazo[1.2-a]pyridin-2-yI)methoxy]-N-(4H-1.2.4-triazoI-3- 
yl)benzamide 




C 17724473 -7.45 

N-(4-chlorophenyI)-4-oxo-l,2.3,4-tetrahydropyrimido[1.2- 
a]benzimidazoIe-2-carboxamide 





C12772985 -7.11 

N-[5-(3,4-dimethoxyphenyI)-4H-l,2,4-triazoI-3-yI]-2-hydroxy-pyridine-3- 
carboxamide 




C13018588 -7.43 

4-cyano-N-[5-(p-toIyI)-4H-1.2.4-triazoI-3-yI]benzamide 




C23411539 -7.11 

N-cycIohexyI-2-propyI-1.2.3.4-tetrahydropyrazino[1.2-a]benzimidazoIe-8- 




C33238037 -7.41 

[4-(2-pyhdyImethyl]piperazin-l-yI]-[4-(l,2,4-triazoI-l- 
yl)ph enyl Jmethanone 



C09738752 -7.10 

N-[5-(3,4-dimethoxyphenyI]-lH-l,2,4-triazoI-3-yI]-4-methyI-benzamide 




{continued on next page) 
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Table 1 {continued) 



C20539931 -7.38 C06971298 -7.03 

N-(2.3-dihydro-lH-inden-2-yI)-5-[3-(methoxymethyI)-1.2.4-oxadiazoI-5- N-(lH-benzoimidazoI-2-yI)-5-fIuoro-benzothiophene-2-carboxamide 
yI]-2-pyridinamine 





C21835537 -7.37 

l-(5-methylisoxazol-3-y l)-3- [ClR)-l-[4-(l,2A-triazol-l- 
yl)phenyl]ethyl]urea 



C29376412-7.01 

N-[4-(cycIopentoxy)phenyI]-4-(tetrazoI-l-yI)benzamide 





N-[3-(5-phenyl-lH-pyrazol-3-yl)-lH-l,2,4-triazol-5-yl]pyrimidin- 
2-amine) establish two H-bonds with the acidic residues in the 
loop. 

Another recurrent feature of the selected molecules is the pres- 
ence of an aromatic ring that is shown to interact with SI 39 (Fig. 7). 
Indeed, Cdc34 SI 39 was identified as a critical determinant for sub- 
strate mono-ubiquitination and polyubiquitination via K48-linked 
ubiquitination by experimental mutagenesis [28]. Moreover, it is 
the homologous residue of Ubc7 D127 that is known to play a cru- 
cial role in positioning the acceptor lysine of the substrate for the 
nucleophilic attack of the thioester bond during catalysis [29]. 

The aromatic ring of the scaffold (compound C08743791 [4,6- 
dimethyl-N-[3-(5-phenyl-lH-pyrazol-3-yl)-lH-l,2,4-triazol-5- 
yl]pyrimidin-2-amine] is reported as proof of concept in Fig. 8) is 
shown to interact with two hydrophobic residues in helix ot3 of 
Cdc34: VI 43 and VI 47. This interaction might account for the 
specificity of the compounds. The valine steric hindrance can also 
limit the size of the inhibitor into the binding site. 

In summary, on the basis of these common chemical features, 
we propose a pharmacophore model (Fig. 9) which features three 
main regions: a central region constituted by two amino groups 
which interact with the acidic residues of the loop, and two lateral 
regions that are characterized by the recurring presence of aro- 
matic groups. 

4. Discussion 

The ubiquitin pathway is responsible for protein regulation by 
promoting target substrates degradation via the 26S proteasome 
system, and its deregulation has been implicated in a variety of dis- 
eases, including the onset of cancer and neurodegenerative disor- 
ders. In light of this crucial role, ubiquitination mechanisms have 
been thoroughly investigated to discover new leads to modulate 
cell cycle progression, as well as to find suitable molecular targets 
for cancer treatment. 



In particular, E2 enzymes have been shown to play a central role 
in ubiquitination, since they act as mediators in determining the 
ultimate fate of the substrates. In this context, they have been 
recently proposed as suitable targets for inhibitory compounds 
[20-22]. 

In this context, we here focused on the human E2 enzyme 
Cdc34, which has been directly linked to tumorigenesis [14-19]. 
In particular, we used here as a model system, the homolog of 
Cdc34 in yeast, Saccharomyces cerevisiae Cdc34 (ScCdc34 or 
Ubc3). Indeed, this enzyme has been extensively characterized by 
our and other works [13,23,30-33] and an MD ensemble of confor- 
mations for ScCdc34 was already available [13,23]. ScCdc34 repre- 
sents a valuable model since it shares a high sequence similarity 
with its human homolog (Fig. 3S). It has also been used as a model 
for studying the relationship occurring between phosphorylation 
and regulation of the E2 enzymatic activity [13]. We here focused 
on the Cdc34 acidic loop as a possible target of action for inhibitory 
molecules, which can constrain its position in the proximity of the 
catalytic cysteine, impairing Cdc34 activity. Indeed, the current 
data available on Cdc34 [13,23,30-33] support that Cdc34 can pop- 
ulate both states in which the loop is in a 'closed' conformation and 
the catalytic cysteine not accessible for Ub-transfer by the El 
enzyme, and active 'open' state in which the loop is open or par- 
tially open and the catalytic cysteine in a solvent-exposed position 
that can thus be compatible with Ub-charging. Those active open 
states, which are likely to be present even in absence of phosphor- 
ylation, are suggested to be further promoted by phosphorylation 
of Cdc34 catalytic domain by a population-shift mechanism 
[13,23]. The aim of our work is to identify molecules with the 
potential of entrapping the inactive states of Cdc34, i.e. a confor- 
mation of the protein in which the loop is stabilized in a closed 
state and the catalytic cysteine is thus buried and not accessible 
for Ub-transfer. 

In this work, MD simulations of ScCdc34 were thus analyzed to 
select a conformation suitable for docking simulations. A large 
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Fig. 4. Molecules identified by the post-clustering ranking according to both structural and energetic criteria. In the left panel, the first 20 high-ranking molecules after 
clustering and a portion of Cdc34 are depicted with surface representation of molecule C08743791. In the right panel, the high-ranking clustered molecules are shown 
superimposed to underline the presence of two recurrent amino groups. The selection step has been carried out post-processing Autodock output with Phyton Pymol API 
tools. 



subset of the ZINC database was used for high-throughput virtual 
screening against ScCdc34 structure, followed by docking simula- 
tions with Autodock to refine the results. More in details, the pipe- 
line that we have followed is constituted by a first step of coarse 
molecular selection based on shape complementarity performed 
by DOCK Blaster, screening the ZINC database. In a second step 
500 compounds selected at the first step were re-docked in the 
Cdc34 binding site by Autodock and a filtering procedure was 
applied based on both spatial criteria and binding free energy to 
identify 20 molecules. It has to be noted that all the molecules 
selected from our in silico screening come from the ZINC database 
and comply with the Lipinski rule of five [27], so that for each 
compound the octanol-water partition coefficient is not greater 
than 5 and the molecular mass is less than 500 Da. These two fea- 
tures, along with a number of hydrogen-bond donor/acceptor 
groups lower than five make the selected molecules likely to dif- 
fuse inside cell membranes, so that they can exert their biological 
activity. 

In the first part of the work, we performed a preliminary screen- 
ing on thousands of compounds from a subset of the ZINC database 
using DOCK Blaster. One could argue that 500 molecules are not a 
sufficient dataset to further investigate. In this context, it has to be 
noted despite being less accurate than Autodock in its predictions, 
DOCK Blaster recapitulates the crystal ligand pose within 2 A rmsd 
50-60% of the time in common benchmarks [25]. We here 



analyzed in details the structural features which the majority of 
the 500 hundred molecules returned by DOCK Blaster have in com- 
mon. These features are more clearly distinguishable after the fol- 
lowing screening performed with Autodock and the spatial 
filtering (Fig. 9). Hence, the presence of distinct common chemical 
and structural determinants in the subset of molecules selected by 
DOCK Blaster supports reliability of its results, so that it is unlikely 
that from a random selection of ZINC molecules we could have 
obtained similar results. 

In the 20 molecules identified in our screening, a set of common 
structural features can indeed be identified used to design a phar- 
macophore model. From an energetic point of view, the most rele- 
vant intermolecular interactions are established by two amino 
groups, which form two H-bonds with carboxylic groups from 
the acidic loop (D108 and El 09). These interactions account for 
much of the calculated binding free energy, and therefore have 
been used by the energy-based ranking as the main discriminating 
feature for molecular selection. To quantitatively assess the impor- 
tance of these hydrogen bonds on the binding free energy for each 
molecule, we also separately evaluated each energy component. As 
shown in Fig. 10, for the first four high-ranking molecules, the 
energy contribution of the hydrogen bonds accounts for more than 
80% of the free energy of binding, as expected. In addition to the 
interactions established with the acidic loop, the selected com- 
pounds share a recurrent hydrogen bond with the Ser 139, that 
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Compound C08743791 Compound C20209924 



Fig. 5. Examples of binding poses for the molecules. The left panel and right panels show the orientation of molecule C08743791 (4,6-dimethyl-N-[3-(5-phenyl-lH-pyrazol- 
3-yl)-lH-l,2,4-triazol-5-yl]pyrimidin-2-amine), and molecule C20209924 (5-methyl-N-{2-[3-(3-pyridinyl)-lH-l,2,4-triazol-5-yl]ethyl}-2-indolinecarboxamide), respec- 
tively. The first molecule is an example of molecule with an orientation suited for the interaction with both the acidic loop and the surrounding structural components (VI 43 
and VI 47), while the latter is only interacting with the acidic loop and belongs to the subset of compounds that has been discarded upon the final selection step with API 
tools. 




catalytic cleft. The aromatic ring of molecule C08743791 (4,6-dimethyl-N-[3-(5- 
Fig. 6. Interactions between the selected molecules and Dl 08 and El 09 residues of phenyl-lH-pyrazol-3-yl)-lH-l,2,4-triazol-5-yl]pyrimidin-2-amine) is reported as 
the acidic loop. Molecule C08743791 (4,6-dimethyl-N-[3-(5-phenyl-lH-pyrazol-3- an example of interaction with VI 43 and VI 47 located on helix a3 of Cdc34. 
yl)-lH-l,2,4-triazol-5-yl]pyrimidin-2-amine) is reported as an example and the 
hydrogen bonds with D108 and El 09 of the acidic loop are highlighted. 




Fig. 7. Interactions between the selected molecules and S139. (Left panel) Aromatic rings are highlighted as a recurrent feature of the high-ranking molecules. (Right panel) 
The aromatic ring of compound C29375629 (2-(2-bromo-4,6-difluoro-anilino)-2-oxo-ethyl]) is shown to interact with SI 39. 
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Fig. 9. Pharmacophore model for Cdc34 inhibitors. The three main areas required for the inhibition of the acidic loop of Cdc34 are highlighted. 
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Fig. 10. Binding free energy of the top four compounds of the post-clustering ranking. The first bar (light blue) refers to the total binding free energy (full list reported in 
Table 1), the second bar (light yellow) is the internal energy obtained as the sum of the H-bonds (and VdW contributions) (olive green) and the electrostatic component 
(gray). The last column (red) refers to the torsional energy. Binding free energy = internal energy + torsional energy. (For interpretation of the references to color in this figure 
legend, the reader is referred to the web version of this article.) 



484 A Arrigoni et al./FEBS Open Bio 4 (2014) 473-484 



has been reported to be a crucial residue for Cdc34-mediated ubiq- 
uitination [28]. An aromatic ring able to interact with a valine clus- 
ter of Cdc34 also contributes to the molecular scaffold. 

5. Conclusion 

E2 enzymes act as mediators in determining the ultimate fate of 
the substrates selected for ubiquitination and for this reason they 
are considered privileged sites of action for inhibitory compounds. 

In the present work, we performed a virtual screening proce- 
dure to select compounds, which may act as inhibitor for the cat- 
alytic activity of Cdc34, an E2 enzyme who has been directly 
linked to tumorigenesis [14-19]. In particular, the molecules were 
selected to target Cdc34 acidic loop, a structural component that 
has been shown to modulate Cdc34 enzymatic activity [13]. 

One of the problems that typically affect virtual screening pro- 
cedures performed on thousands of compounds is the high number 
of false positives which lowers confidence of calculations results. 
To overcome this bias, we used at first DOCK Blaster, whose pur- 
pose is to provide a fast but coarse analysis of the entire Zinc sub- 
set, followed by Autodock calculations performed on the 500 high- 
ranking compounds selected by DOCK Blaster. After further refine- 
ment of the docking results, we have built a pharmacophore model 
from the ranked list of selected molecules, which can be a useful 
scaffold to develop and test inhibitory molecules for Cdc34in 
future experimental studies. 
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